Search CORE

151 research outputs found

Dependence relationships between Gene Ontology terms based on TIGR gene product annotations

Author: Borgelt Christian
Kumar Anand
Smith Barry
Publication venue
Publication date: 01/01/2004
Field of study

The Gene Ontology is an important tool for the representation and processing of information about gene products and functions. It provides controlled vocabularies for the designations of cellular components, molecular functions, and biological processes used in the annotation of genes and gene products. These constitute three separate ontologies, of cellular components), molecular functions and biological processes, respectively. The question we address here is: how are the terms in these three separate ontologies related to each other? We use statistical methods and formal ontological principles as a first step towards finding answers to this question

PhilPapers

Fast Multiplex Graph Association Rules for Link Prediction

Author: Borgelt Christian
Coscia Michele
Szell Michael
Publication venue
Publication date: 22/11/2022
Field of study

Multiplex networks allow us to study a variety of complex systems where nodes connect to each other in multiple ways, for example friend, family, and co-worker relations in social networks. Link prediction is the branch of network analysis allowing us to forecast the future status of a network: which new connections are the most likely to appear in the future? In multiplex link prediction we also ask: of which type? Because this last question is unanswerable with classical link prediction, here we investigate the use of graph association rules to inform multiplex link prediction. We derive such rules by identifying all frequent patterns in a network via multiplex graph mining, and then score each unobserved link's likelihood by finding the occurrences of each rule in the original network. Association rules add new abilities to multiplex link prediction: to predict new node arrivals, to consider higher order structures with four or more nodes, and to be memory efficient. We improve over previous work by creating a framework that is also efficient in terms of runtime, which enables an increase in prediction performance. This increase in efficiency allows us to improve a case study on a signed multiplex network, showing how graph association rules can provide valuable insights to extend social balance theory.Comment: arXiv admin note: substantial text overlap with arXiv:2008.0835

arXiv.org e-Print Archive

Test Statistics for the Identification of Assembly Neurons in Parallel Spike Trains

Author: Christian Borgelt
David Picado Muiño
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2015
Field of study

In recent years numerous improvements have been made in multiple-electrode recordings (i.e., parallel spike-train recordings) and spike sorting to the extent that nowadays it is possible to monitor the activity of up to hundreds of neurons simultaneously. Due to these improvements it is now potentially possible to identify assembly activity (roughly understood as significant synchronous spiking of a group of neurons) from these recordings, which—if it can be demonstrated reliably—would significantly improve our understanding of neural activity and neural coding. However, several methodological problems remain when trying to do so and, among them, a principal one is the combinatorial explosion that one faces when considering all potential neuronal assemblies, since in principle every subset of the recorded neurons constitutes a candidate set for an assembly. We present several statistical tests to identify assembly neurons (i.e., neurons that participate in a neuronal assembly) from parallel spike trains with the aim of reducing the set of neurons to a relevant subset of them and this way ease the task of identifying neuronal assemblies in further analyses. These tests are an improvement of those introduced in the work by Berger et al. (2010) based on additional features like spike weight or pairwise overlap and on alternative ways to identify spike coincidences (e.g., by avoiding time binning, which tends to lose information)

Crossref

Directory of Open Access Journals

PubMed Central

Efficient Identification of Assembly Neurons within Massively Parallel Spike Trains

Author: Berger Denise
Borgelt Christian
Grün Sonja
Louis Sebastien
Morrison Abigail
Publication venue: Hindawi Publishing Corporation
Publication date: 01/01/2010
Field of study

The chance of detecting assembly activity is expected to increase if the spiking activities of large numbers of neurons are recorded simultaneously. Although such massively parallel recordings are now becoming available, methods able to analyze such data for spike correlation are still rare, as a combinatorial explosion often makes it infeasible to extend methods developed for smaller data sets. By evaluating pattern complexity distributions the existence of correlated groups can be detected, but their member neurons cannot be identified. In this contribution, we present approaches to actually identify the individual neurons involved in assemblies. Our results may complement other methods and also provide a way to reduce data sets to the “relevant” neurons, thus allowing us to carry out a refined analysis of the detailed correlation structure due to reduced computation time

CiteSeerX

Directory of Open Access Journals

PubMed Central

Juelich Shared Electronic Resources

Selecting appropriate surrogate methods for spike correlation analysis

Author: Christian Borgelt
Date
George Gerstein
Gerstein
Gerstein & Perkel
Grün
Harrison & Geman
Hatsopoulus
Kass
Markus Diesmann
Pipa
Sebastien Louis
Smith & Kohn
Sonja Grün
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Efficient Implementations of Apriori and Eclat

Author: Christian Borgelt
Christian Borgelt Department
Publication venue
Publication date
Field of study

Apriori and Eclat are the best-known basic algorithms for mining frequent item sets in a set of transactions. In this paper I describe implementations of these two algorithms that use several optimizations to achieve maximum performance, w.r.t. both execution time and memory usage. The Apriori implementation is based on a prefix tree representation of the needed counters and uses a doubly recursive scheme to count the transactions. The Eclat implementation uses (sparse) bit matrices to represent transactions lists and to filter closed and maximal item sets

CiteSeerX